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Abstract 

We investigate a variant of hypercubic gauge link smearing where the SU{3) projection is replaced 
with a normalization to the corresponding unitary group. This smearing is differentiable and thus 
suitable for use in dynamical fermion simulations using molecular dynamics type algorithms. We 
show that this smearing is as efficient as projected hypercubic smearing in removing ultraviolet 
noise from the gauge fields. We test the normalized hypercubic smearing in dynamical improved 
(clover) Wilson and valence overlap simulations. 
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I. INTRODUCTION 

In recent years, significant progress has been made in full QCD lattice simulations. There 
are simulations with 2+1 flavors, with realistic quark masses, and in large volumes, though 
frequently only two of the three conditions are met at once. These simulations are performed 
with difl'erent kinds of fermion formulations, from the simplest unimproved Wilson fermions 
to highly improved nearly chiral fermions, with improved rooted staggered fermions and 
even with the expensive but exactly chiral overlap fermions. All these calculations, even 
those with inexact chiral symmetry, are still expensive and require large computer resources. 
Improving the fermionic action such that simulations could be performed on coarser lattices, 
or improving the performance of algorithms to better fit today's computer power is important 
for truly realistic simulations. It seems that a simple modification, the use of smeared gauge 
fields in the fermionic action, can help improve both the action and the computational 
performance as well. 

Smeared links are a natural part of improved fermionic actions. In the perfect action 
formulation the Dirac operator at the renormalization group fixed point is fitted by an 
extended but ultra-local Dirac operator. This fit is not feasible unless the gauge links of 
the Dirac operator are smeared [l|. The exactly chiral overlap operator ^ effectively also 
contains smeared links, even if the kernel operator is based on thin links. This can be seen 
when one considers the expanded form of the overlap formulation with the square root term 
in d{—Ro)/ ^yd^{—Ro)d{—Ro). The order d^, d^, etc. terms all contribute to the nearest 
neighbor fermion coupling of the overlap Dirac operator with extended gauge connections. 
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The most frequently used staggered fermion formulation, the so called Asqtad action, also 
uses fat links [3j. The smeared links discussed above are part of the definition of the Dirac 
operator. The gauge action is independent and in most cases is not smeared. 

The effect of smearing is two-fold. First, it averages out small scale vacuum fluctuations, 
reducing the non-physical ultra-violet noise in the fermionic action, secondly it removes 
extreme local fluctuations of the gauge flelds, lattice dislocations. In the various fermion 
discretizations, the effect of vacuum fluctuations and dislocations comes in different dis- 
guises. Staggered fermions' taste breaking is triggered by gauge fleld fluctuations within 
the hypercube and smearing can effectively reduce this effect jj, Q|. For Wilson fermions 
dislocations contribute to the spread of the near zero real modes of the Dirac operator. 
Those modes make it impossible to simulate at small quark masses without going to very 
fine lattice spacing, and/or large volumes Smearing removes the dislocations and reduces 
the spread of the eigenmodes |6|. 

Chiral fermions can also benefit from smeared links. The cost of the overlap operator is 
largely given by the density of low modes of the kernel operator from which it is constructed. 
Smearing reduces the occurrence of these low modes and thereby can reduce the cost of 
applying the operator by an order of magnitude 0, S|- In simulations using domain wall 
fermions, the low modes of the kernel operator are known to cause explicit breaking of chiral 
symmetry, indicated by a non-vanishing residual mass. If there are fewer of those modes, 
chiral symmetry is realized to a higher degree and one can use a smaller fifth dimension 
without increasing the residual mass. 

There is no unique criterion what constitutes a "good" smearing procedure besides the 
explicit construction of the fixed point Dirac operator or the expanded form of overlap 
fermions. Without such guiding principles, any smearing, as long as it consists of adding 
irrelevant (local) operators to the action, is acceptable. The smeared links do not even have 
to be SU{3) elements as is illustrated by the success of the Asqtad action. Any acceptable 
procedure will lead to a valid action, but chosen properly, smearing will improve the scaling 
of the continuum limit. If the modification of the gauge fields are too weak, the smearing 
has no effect. On the other hand, a definition of the fat link which spreads over many sites 
and heavily mixes the links can lead to an action which again has strong cut-off effects j^. 
Thus, an optimal smearing is as local as possible while removing as much of the short scale 
fluctuations as possible. 

The flrst smearing was introduced by the APE collaboration and different forms 
of smearing have been used in quenched studies since then. Dynamical simulations with 
smeared links became practical when the fully differentiable stout smearing was proposed 
by Morningstar and Peardon [uj. Iterating either APE or stout links can wash out short 
to intermediate scale physical properties of the action, leading to large scale violations in 
quantities sensitive to those scales. Hyper-cubic (HYP) blocking, introduced in Ref. 
circumvents this problem by reducing the spread of consecutive smearing steps. In this 
paper, we will discuss variants of the HYP blocking that are differentiable and suitable for 
molecular dynamics simulations. 

In the next Section we first modify the APE construction by replacing the original SU{3) 
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projection by a normalization to U{3). These normalized n-APE links are differentiable and 
as effective in removing short scale vacuum fluctuations as the projected APE smearing. 
Next we combine n-APE smearing with the HYP definition and show that n-HYP links are 
as effective as 3 levels of stout smearing and are considerably better than HYP links con- 
structed from stout smearing. The differentiable n-HYP smearing can be used in dynamical 
simulations and in Sect. [1111 we give details of how the fermionic force can be evaluated 
with n-HYP smearing. This force term can be combined with any fermionic action and in 
Sect. HVlwe illustrate the effectiveness of the smearing both with overlap and Wilson clover 
fermions. 



II. DEFINITION OF THE SMEARED LINKS 



The APE smeared link [10| is the basis of most smearing methods. First the staple sum 
^n,f, = Y^u^p, Un,uUn+u,fiUl+^^^ is added to the original link f4,^ as 

^n,fi = (1 - a)^n,M + «' Tn.M • (l) 

Here a' = a/m and m is the number of staples included in the staple sum. Next fi, a general 
N X N matrix, is projected back to SU{N) as 

V„ = max Re tr (Vn^) . (2) 

^ V£SU{3) ^ 

In the following we will refer to this construction as projected- or p-APE. Since no closed 
form for the derivative of the p-APE links is known, they are difficult to use in molecular 
dynamics (MD) simulations. 



Not long ago Peardon and Morningstar suggested a differentiable smearing method 
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Their construction uses the staple sum r„^^ to define the differentiable SU{N) stout link as 

Vs = eP'U, (3) 

s = ^(rt/t - t/rt) - ^tr(rf/t - ur^) . (4) 

It is not obvious why the suggested form is a smearing at all beyond the perturbative regime 
where TW ^ nl. There the stout links are indeed identical to projected APE smeared 
links with p = a/Q Nevertheless stout smearing appears to work similarly to APE well 
beyond the perturbative regime. 

Here we consider a smeared link that is closer in spirit to the projected APE links but it 
is differentiable and appropriate for MD simulations. From the N x N general fl matrix of 
Eq. ^ we form a U{N) unitary matrix as 

K = . (5) 

Since il^il is Hermitian and positive definite, is well defined, unless detil = 0. 

The smeared link Vn is unitary but not in SU{N), its determinant in general is not one. 
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The form in Eq. (I5|) was first used in Ref. [l3| to define smeared operators, while in Ref. [14 



Vn is divided by the cube-root of its determinant to define an SU{N) link. 

One should note that there is no requirement that the smeared link be an SU (N) element, 
but in practice projecting the link back to SU{N) was found to be more effective in removing 
short scale fluctuations. Here we will show that the U{N) element Vn link is as effective as 
the projected smeared link. In the following we will refer to the Vn links defined in Eq. ((51) as 
normalized- or n-APE smearing. Since at the 1-loop perturbative level neither the projection 
nor the normalization of the link plays any role, the 1-loop perturbative properties of all 
three smearing prescriptions are identical. 

HYP smearing, as introduced in Ref. consists of three consecutive projected APE 
type smearing steps but the staple sums at the higher level are constructed such that only 
links within the hypercubes attached to the original link enter. The consecutive smearing 
levels are constructed as 



K,M = Pr0j5^(3)[(l - ai)Un,p + y 5^ Vn,u;pVn+0,p-,yV^^^^^.J , (6) 

Vn,p-u = Proj5f/(3)[(l - a2)?7n,Ai + y 2^ ^ n,p;u pV n+p,fM-puV n+f^^p-u p] ^ (7) 

±p^u,p. 

Vn,p;up = Pr:0isui3){i'^- (^3)Un,p + Y 2^ Un,r,Un+f,,^JJl+j,^^] ■ (8) 

±ri^p,v,p 

The Un,p are the thin links from site n in direction /i, the Vn,p are the resulting HYP blocked 
fat links. The intermediate fields V and V are constructed such that the contributions to V 
are restricted to the attached hyper-cube. The indices after the semi-colon always indicate 
the directions excluded from the sums. The three SU(3) projections make the HYP smeared 
configurations very smooth while keeping the smearing within a hypercube ensures that 
even short distance properties of the configurations are only minimally distorted. While the 
main ingredient, the SU(3) projections, make the HYP links difficult to use in dynamical 
simulations, any of the above discussed differentiable smearings can be combined with the 
HYP construction. In the following we will refer to the original HYP links as p-HYP, to the 
normalized smearing as n-HYP and the stout HYP construction as stout - or s-HYP. Again, 



at the 1-loop perturbative level the three descriptions are identical 
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A. Stout and n-APE smearing in SU(2) 

The two smearing prescriptions are easiest to compare for the gauge group SU(2). The 
relevant quantity for both is which is a linear combination of SU(2) elements and can 
be written as 

TU^ = ujQl + i uoa, (9) 
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where uq and u = hoj are real. For SU{2) the traceless anti-Hermitian part of VU\ S in 
Eq. ([4]), is just iua and we thus have 

Vs = e'P'^^U = [cosipcu) + i sin(ptu) na]U, (10) 

while the APE link ([T]), normalized according to Eq. ([5]) is 



Vr 



n 



^7 ^ ^T^^^ 



U, (11) 



iVo = ./{l+I^JofH^ , (12) 

where ^ = a' /{I — a). The stout link is independent of ujq, the trace of TW , and thus 
contains less information about the original fields than the n-APE link. 

Eqs. (fTOl) and ffTTjl can nevertheless be approximately identical if pu, ^ 1, and ujq 
can be replaced by its average value. According to Eq. ([9]) c^o is related to the trace of the 
plaquettes around the thin link f/, so {uq) = mtr(f/piaq)/A^ = m + 0(c<j^) when u <^ 1. 
These conditions are satisfied near the continuum limit where fiuctuations are suppressed 
and the gauge links are close to the unit matrix. Then stout and n-APE links agree if 
C,uj/No ^ sin(pco'), or 

C a/m 
^ = 1 + i{uJo) ^ 1 - a(l - tr(f/pi,q)/Ar) " ^'^^> 

This relation agrees with the perturbatively expected form p = a/m if tr([/piaq) = A^. On 
typical MC configurations the plaquette is considerably smaller than that, suggesting that 
even if stout and n-APE smearing can be matched on MC configurations, the corresponding 
stout parameter could be significantly different from the perturbatively expected value. 
While the optimal parameter for APE smearing is largely independent of the gauge coupling, 
this is not so for stout smearing. On rough configurations where puj is not small and Uq 
cannot be replaced by its average, stout links could be very different from n- or p-APE links 
and resemble little the form of Eq. ([T]) . 



B. Comparing projected, normalized and stout smearings 

Smearing reduces lattice artifacts by removing some of the non-physical ultraviolet fiuc- 
tuations of the gauge configurations. The effectiveness of the smearing can be measured by 
the smoothness of the plaquette, i.e. by the value of the average plaquette, and even more 
so by the distribution of the smallest plaquette on finite volume configurations. 

The comparisons presented in this section are based on a set of 500 quenched 8"^ lattices 
generated with the plaquette gauge action at /3 = 5.8, corresponding to a lattice spacing 
of 0.136 fm. In Fig. [T] we show the average plaquette after one level of p-APE, n-APE 
and stout smearing as a function of the smearing parameter a. The values measured after 
projected and normalized APE smearing are nearly indistinguishable, predicting the best 
smearing at about a = 0.75. Above this value the smearing becomes unstable, the average 
plaquette drops even with only one level of smearing. The stout smeared plaquette is plotted 
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Figure 1: The average plaquette on quenched /? = 5.8 configurations as a function of the smearing 
parameter a after various single level smearings. 
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Figure 2: The histograms show the distribution of the smallest smeared plaquette. The left panel 
compares projected (lines) and normalized (shaded) APE with a = 0.75 smearing parameter. The 
right panel compares n-APE (shaded) with a = 0.75 and stout smearing with Qp = 1.1 (lines). 



in two different ways: once with the perturbatively predicted relation a = 6p, and also with 
the relation based on the SU{2) prediction of Eq. f |T3l) . While the former parametrization 
leads to a very different result than the APE smeared links, the latter one is surprisingly 
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consistent with those 

The most extreme fluctuations can be studied from the tail distribution of the plaquette. 
Figure [2] shows the histogram of the smallest plaquettes. The left panel compares p-APE 
and n-APE smearings at the same a = 0.75 parameter value. It is surprising how small the 
deviation is between the two smearings even here when individual plaquettes are considered. 
If any difference is observable, it is to the advantage of the n-APE smearing in the sense 
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Smearing 


(trC/p) 




Ix stout 


2.60 


-0.78(1) 


2x stout 


2.84 


-0.16(2) 


3x stout 


2.91 


0.46(3) 


s-HYP a) 


2.80 


-0.39(2) 


s-HYP b) 


2.64 


0.00(5) 


n-HYP 


2.82 


0.38(3) 



Table I: Comparison of the plaquette and the minimum plaquette values. The different smearings 
considered are: 1, 2 and 3 levels of stout smearing with 6p = 0.9; stout-HYP a) with parameters 
6p = (0.85, 0.75, 0.35) ; b) with parameters 6p = (1.2, 1.0, 0.4); n-HYP with standard HYP 
parameters a = (0.75, 0.6, 0.3); 



that the latter produces slightly larger minimal plaquette values. The right panel compares 
a = 0.75 n-APE and stout smearing at its optimal value, 6p = 1.1. The difference is obvious, 
n-APE removes more of the extreme fluctuations than stout smearing. Stout smearing with 
6p = 0.75 is considerably worse than n-APE smearing. 

Next we consider HYP links based on the three different smearings. In Ref. jl| the 
projected-HYP parameters were optimized by maximizing the smallest plaquette on a set 
of coarse (/? = 5.7) configurations. The optimal parameters found that way (ai = 0.75, 
a2 = 0.6 and = 0.3) turned out to be fairly independent of the gauge coupling and close 
to the perturbative values that minimize taste violations for staggered fermions (ai = 0.875 
a2 = 0.571 and as = 0.25). Since we found that n- and p-APE smearing are nearly iden- 
tical numerically and they are identical perturbatively, we expect that the same parameter 
values are optimal for n-HYP as well. To optimize the stout-HYP parameters we repeated 
the procedure of Ref. We found it difficult to identify an optimal parameter set, the 
sensitivity, especially to the last parameter p^, is weak compared to statistical fluctuations. 
The best parameter values were large, even larger than what one would predict based on 
Eq. (fT3ll . and did not remove as many of the small plaquettes as n-HYP smearing. 

In Table [T] we compare the average plaquette and the average of the minimum plaquette 
values. In addition to n-HYP smearing with parameters a = (0.75, 0.6, 0.3) we consider 
1, 2 and 3 levels of stout smearing with Qp = 0.9 smearing parameter, s-HYP smearing 
with parameters 6p = (0.85, 0.75, 0.35) and with parameters 6p = (1.2, 1.0, 0.4). The 
former s-HYP parameters correspond to n-HYP parameters rescaled according to Eq. (fT3l) . 
the latter one to the values found by optimizing the minimum plaquette distribution. The 
average plaquette value does not always follow the minimum plaquette. Based on the average 
plaquette one would expect that 2 levels of stout smearing are about the same or better than 
n-HYP. This expectation is false as we will show in Sect. HV] . The minimum plaquette is 
a much better indicator of the quality of smearing. That observable puts n-HYP close to 3 
levels of stout and considerable better than s-HYP even with the optimized b) parameter 
set. 

To summarize our observations, we expect normalized-HYP to be as good as projected- 
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HYP with the same parameter values. The n-HYP parameters do not have to be changed 
with the gauge coupling and perturbative corrections are expected to be small. If stout-HYP 
smearing is used in numerical simulations, the parameters will have to be tuned depending 
on the gauge coupling. Stout-HYP smearing with parameters tuned that way is effective 
in removing average fluctuations though it does not work as well in removing the extreme 
fluctuations. At one-loop perturbation theory all three smearings are identical, but since the 
optimal stout parameters at large or moderate lattice spacing are well above the perturbative 
values, one expects larger perturbative corrections for stout links. In dynamical updates 
the computational overhead for n-HYP and stout-HYP is similar, therefore overall n-HYP 
appears to be a better choice for simulations. In the following we describe the implementation 
of n-HYP smearing in dynamical simulations. 



III. FORCE OF THE NHYP LINK 



The equations of motion which are approximately solved in the molecular dynamics evo- 
lution derive from d?-^/dr = 0, where Ti = /2 + Sf + Sg is the molecular dynamics 
Hamiltonian. The computation of the fermion contribution to the derivative is subject of 
this section. We denote the part of the fermionic action that depends on the smeared links 
by Scsiy) and assume its derivative with respect to the V links has already been per- 
formed. We now describe how to use the chain rule to compute the derivative with respect to 
the thin links. In our discussion we follow closely Ref. [ll||. We start out with the derivative 
of S'eff with respect to the simulation time parameter r 

d . - — •" " - - ^ (14) 



dr 



eff 



Retr^^^^ = Retr . 



SV^ dr 



Here V = dV/ dr refers to the derivative with respect to the simulation time r. Next we use 
the definition of V in terms of the thin links U and the fat links V according to Eq. ([6]), 
with the projection replaced by the normalization as given in Eq. ([5]), to get 



Re tr (S^V;,) = Re tr 



y.Wfr -L V 



n,u;ii 



-"n,fi 



dVr 



dVr 



(15) 
(16) 

(17) 



where the sum over m runs over all sites in the ii-v plaquettes attached to the link (n, /x) 
and p can be either /i or v. Next we express Vfj^-y in terms of the thin links U and smeared 
links V according to Eq. ([7]), and continue this procedure until we reach the level where only 
derivatives of the thin links are left 



Re tr {^^V,) 



Re tr 
Re tr 



(18) 
(19) 
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with 



y(2) _ y{l)^Y^ /of^x 

yr(2) ^ ?;(!) dVm,a;l3 

m,a;/3 ^^n,p;j/,At 

Here we can finally identify sIj^-* + slf-* + S^f-* = 5Scq/6U^ as the fermionic force term. 

Since the additional levels to Eq. ( ITSll are very simple modifications of the first level — 
only restricting the directions the sum runs over — let us restrict the following discussion 
to the first level. In terms of VL defined in Eq. ([T]), the n-APE link is then given by = 
To compute the inverse square root of Q = ^2+^2, we employ a method 
analogous to Morningstar and Peardon using the Cayley Hamilton theorem. A non-singular 
3x3 matrix Q can always be written as 

Q-y^ = f,l+f,Q + f^Q\ (21) 

where the scalars /o, /i, and /2 are functions of the traces of Q, and only. It is 
convenient to define 

Co = trQ ; ci = |trQ2 ; c2 = |trQ^ (22) 

The details of the functional dependence of the fi on the Cj is discussed in Sec. IIII A[ 

To use the strategy indicated in Eq. (fTSl) . we apply the chain rule until we are only left 
with derivatives of U or V , cycled to the right of the trace. For simplicity in the following 
we drop the index /i. 

Re trEV" = Re tr (^E^(fig-^/2) j 

= Re tr(Q-^/2sr2) + tr(Sn)/o + tr(QSn) A + ii{Q'^T.n)f2 (23) 
+/itr(SfiQ) + /2tr((Sr]Q + QSfi)Q) . 

Since the fi are scalar functions of the traces c„ we get 

f^ = Y. ■ (24) 

The computation of the derivatives hij = dfi/dcj is described in the next section. Defining 
Bn = bon + binQ + &2n <5^ Eq. (Es]) Icads to 

Re tv{Q-^^^m) + Re tr | [ ^ tr(5„S^])g" + /iSfi + ^(Sfig + Qm) g| . (25) 

Next, we define the sum in the square bracket as A and use that Q = Q^Q to get 

Re tr |(g"^/^S + An+ + A^n^)n^ = Re tr(m) (26) 
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with T = {A + + Q ^/^S. To compute the derivative of fl, we apply the chain rule 

again 

Re tr(S„^^V;i^^) = Re tr(r„_^n„^^) 

= Re tr r„,^ (1 - a)f7„,^ + Vn,u■,|,Vn+o,^,■uV^+o+f,,u■^ 

Now we can write down the final expression for S^^^ . First there is the "global" contribution 
from the thin link 

S« = (1 - a)r„,^ (27) 

and then there is the term that is multiplied with the derivatives of the l^'s, which we have 
to collect from the various contributions from neighboring sites 

t7 t7^ r -1-17 r 17^ -u r"!" 17^ v'^ 



(2) 

The next term in the force expression, S}j , is calculated the same way, by replacing S^with 
and with V^^i,, and similarly for slf-*. 



A. Derivative of the / constants 

This section describes the computation of the Cayley-Hamilton constants for the matrix 
and their derivatives with respect to the traces of Q". The starting point is the 
definition in Eq. (l2Tll . Since the matrix Q = Q^Q is a positive, Hermitian matrix, it can 
be diagonalized with non-negative eigenvalues gi. Eq. ( l2Tll then translates into an equation 
relating the eigenvalues to the coefficients /. 



(28) 



This equation has to be solved for /. Naturally, all expressions are symmetric in the eigen- 
values go, gi and g2- It turns out to be convenient to express the solution in terms of the 
symmetric polynomials of the square roots of the eigenvalues y/gl 



(l 


90 9o^ 


(fo) 




/ -1/2 \ 

9o \ 


1 


9i 9\ 


fi 








92 gi J 


\f2) 







u 



9o + V9i + V92 



^ V9m + V9(m + V9i92 
such that we get for the coefficients / the following results 

—wiv? + v) + uv"^ 



w 



V9o9i92 



(29) 



fo 



/2 



w{uv — w) 

—w — + 2uv 

w{uv — w) 
u 

w{uv — w) 



(30) 
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To compute the symmetric polynomials, we need a closed formula of the eigenvalues of Q 
in terms of its traces (which are independent of the basis) 

n + I n + 1 ^-^ 

i 

This leads to a cubic equation whose solution is most easily expressed in terms of 

5 = Ci/3-c2/18 ; = C2/2 - coCi/3 + ; ^ = arccos (^-^^ (31) 

with which the eigenvalues read for n = 0, 1, 2 

= I + 2v^ cos {^^ + {n- l)y ^ (32) 

Finally for their use in Eq. (l25ll , we need to compute the derivatives of the /j with respect 
to the traces Cj. To this end, we use the chain rule and write 

The matrix |^ is the inverse of the Vandermonde matrix |^ = Qj. Factoring out the 
common denominator d = 2w^{uv — we get for the symmetric matrix B = C/d 

Coo = -w^u^ + 'ivw^u^ + ?,v^wu^ - v^u^ - Aw^u^ - I2v^w'^u^ 

Cqi = —w'^u^ — v'^wu^ + v^u^ + Qvw'^u^ — bw'^u^ — v'^wu^ — 2v^u^ 
— 6f w u + lOvw u + 6v wu — 3w u — 6v w u + 2v w 

C02 = w'^u^ + v^wu^ - v'^u^ - Avw^u^ + Aw^u"^ + 3v^wu^ - Sv'^w'^u + vw^ 

Cii = -wu^ - v^u^ + Ivwu^ + Av^u^ - 5w^u^ - IQv'^wu* - Av\^ + IQvw'^u^ 
-Sw^"^ + Uv^wu"^ - 12vVu + 3vw^ 

C12 = wu^ + - bvwu^ - 2v^u^ + Aw^u^ + Qv^wu^ - Gvw'^u + 

C22 = —wu^ — v'^u^ + ?)vwv? — 3w^u . 

Note that this expression is singular only for w = 0, because uv — w > as long as one 
eigenvalue is non-zero. The pole in w = ^/gogigo, corresponds to at least one zero eigenvalue 
of Q. 



IV. NUMERICAL TESTS 



The calculation of the fermionic force is considerably more involved with HYP links than 
with stout links, but once the contribution from the smearin g is implemented, it can simply 
replace a stout smearing force routine. Since in Ref. 0, EsI-TigI. EtI . EsI . 19| stout smearing 
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was used in dynamical overlap simulations, we have tested n-HYP smearing in the same 
set-up. We have also implemented smearing in dynamical Wilson clover simulations. In the 
following we briefly summarize our experience with n-HYP links, concentrating mainly on 
algorithmic issues. 

A common algorithmic concern, independent of the fermionic formulation, is the potential 
occurrence of links with exactly zero determinant, detf2 = 0. In such case the normalized 
smeared link is ill-deflned and the force term diverges. In our test runs we found only once 
out of 10^° smeared link evaluations detf2 ^ 10^* and in single precision arithmetics that 
resulted in an exceptionally large force term. The corresponding conflguration was rejected 
and the simulation continued without problem. In double precision even this one occurrence 
could have been handled. The problem of det^l ~ might become much more severe at 
(even) coarser lattices but will disappear on the way to the continuum. 

A. Overlap tests 

Smeared links are a common ingredient to chiral fermion simulations because the cost 
of the Dirac operator application depends to a large part on the spectral properties of the 
kernel operator it is constructed from. To be speciflc, let us concentrate on Neuberger's 
overlap operator 

TTl 

Dov = {Ro - ^) [1 + 75e(/^(-i?o))] + m,, , (34) 

with Rq the radius of the Ginsparg-Wilson circle, rriov the bare quark mass, e the matrix 
sign function and h = 'j^d the Hermitian Dirac kernel operator at negative mass shift — -Rq- 
d is a Wilson like lattice Dirac operator, for our tests we take the planar operator discussed 
in Refs. 0,0|. 

Evaluating the action of the matrix sign function of /i on a vector is the expensive part 
of overlap fermion simulations. The standard technique is to compute the lowest few eigen- 
modes of h explicitly and use the spectral representation of the sign function for the corre- 
sponding sub-space. For the rest of the spectrum, a polynomial or rational approximation 
is used. In our test we use the Zolotarev rational approximation. The approximated sign 
function therefore reads 

^ ^ E 7^ (1 - E ^a) + signA P, (35) 

i * A A 

with Pa the projector on the low-mode of h{—Ro) with eigenvalue A. For each application 
of Dov on a vector a multi-shift system with the kernel operator has to be solved. Its 
condition number (and therefore the cost) decreases if the region from which the modes are 
treated explicitly is increased. Firstly, the lower bound of the Zolotarev approximation can 
be increased which yields a larger minimal q. Secondly the smallest mode of h"^ which has 
not been projected is larger. Thus the condition number of the whole system is smaller and 
it takes less iterations to solve the system of linear equations. A lower density of modes at 
the origin can therefore greatly reduce the cost of using the overlap operator. This can be 
achieved by constructing the kernel operator h from smeared links. 
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n-HYP 


2x stout 


3x stout 




262(19) 


604(38) 


291(22) 


(lAiol) 


0.31(1) 


0.16(1) 


0.28(1) 



Table II: The average number of applications of the kernel operator per application of the overlap 
for different kinds of fat links. We also give the average of the absolute value of the tenth eigenvalue, 
the largest eigenvalue for which we use the spectral representation. 



To estimate how smearing in the kernel operator affects the cost of overlap simulations, 
we compute one component of the overlap propagator at mass arriov = 0.03 on 30 12^ x 24 
dynamical clover configurations described in Section HV B[ On each configuration we project 
out the lowest 10 eigenmodes of the kernel operator h{—Ro). The number of iterations of 
the solver in the application of the sign function is averaged over the whole computation of 
the propagator. This gives the largest part of the cost of applying the overlap operator in a 
realistic situation. 

We compare kernel operators built from n-HYP links and stout links with two and three 
levels of smearing. The stout smearing parameter is set to 6p = 0.9 which is the value used 
in recent calculations using dynamical overlap fermions |l6l . Il8l |. while for n-HYP we use 
the standard HYP parameters. The results are displayed in Table HTl The largest projected 
mode is around 0.3 for both n-HYP and three levels of stout smearing whereas it is roughly 
half that for 2 levels of stout smearing. Because the smallest shift is much smaller than that, 
this also means that the condition number of the former is a factor two smaller than for the 
latter. 

This is also reflected in the cost of applying the overlap operator. Two levels of stout 
smearing is about twice as expensive as either n-HYP or three iterated stout smearings. 
However, the n-HYP smearing is more local than three levels of stout smearing and also 
comes with smaller coefficients mixing the original links with the staple. 



B. Wilson clover action tests 

We have implemented n-HYP smearing with two fiavor 0(a) improved Wilson fermions. 
For the gauge action we use the Liischer-Weisz action and fix the tadpole coefficient uq to 
be 0.875, the value that corresponds approximately to our simulation values. Note that this 
choice affects the gauge action only since the clover coefficient is left at its tree-level value 
Csw = 1-0; preliminary simulations indicated that this is close to the value that minimizes 



the width of the spectral gap of the Hermitian Dirac operator [26|. At /3 = 7.2 the lattice 
spacing is around 0.13 fm and simulations in smaller volumes (lattice size of 8^ x 12) predict, 
from the vanishing of the PC AC quark mass (see Fig. [3]), a critical hopping parameter of 
Kc = 0.12787(14). This value is surprisingly close to the one found in a quenched simulation 



with p-HYP smearing at similar lattice spacing [12||. The additive mass shift is dramatically 
smaller for HYP links than for thin link clover fermions, even with non-perturbative Csw- 
The remaining results quoted in this section are obtained from simulating a 12^ x 24 lattice 
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Figure 3: The PC AC quark mass from simulations at /? = 7.2 on 8^x12 lattices. The additive mass 
shift is only amadd = 0.090(4). 

at P = 7.2 and n = 0.1266. We have accumulated 500 trajectories after thermalization and 
measured eigenvalues of D and 75-D, n-HYP smeared Wilson loops, as well as pseudoscalar 
correlators every 5 trajectories. 

The three most expensive parts of the update are the calculation of the fermionic force 
(including inversions) , the gauge force and the n-HYP blocking (including the n-HYP force 
term). Of the total CPU time in these runs they consumed 75%, 13% and 11%, respec- 
tively. Thus, even with inexpensive fermion formulations such as Wilson the computational 
overhead of the n-HYP blocking is negligible. One should also note that the inversions of 
the Dirac operator are expected to be significantly cheaper than in a comparable physical 
situation with thin link clover fermions if the latter is possible at all. 

A few remarks on the details of our simulation are in place: Each trajectory was split 
in 25 steps using a Sexton- Weingarten integrator and the same integrator on a finer time 
scale was also used for the gauge force. This resulted in an acceptance rate of 0.879(7). On 
32 nodes of a Myrinet cluster with 2GHz Xeon processors, one unit length trajectory took 
about 17 minutes to complete. 



From fits to the static quark potential [2]J we extract the Sommer scale [22| ro/a 



3.903(25) and the string tension ay/a = 0.2897(26). The bare current quark mass am = 
0.0451(9) is in good agreement with the small volume data shown in Fig. [H indicating small 
cutoff effects. Assuming tq = 0.5 fm we obtain a lattice spacing of 0.128(1) fm and bare 
current quark mass of 69.4(1.5) MeV. We find a ratio of pseudoscalar to vector meson mass 
of 0.57(3). 

The behavior of the low-lying eigenmodes of the Dirac operator are of particular interest 
if one wants to determine the degree of chiral symmetry that is retained at finite lattice 
spacing. Also, the lowest eigenvalue of the Hermitian Dirac operator 75D is important for 
algorithmic reasons as it determines the spectral gap and indicates the lowest bare quark 
mass potentially accessible at a given lattice spacing and volume 
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Figure 4: The infrared spectrum of the n-HYP Wilson clover Dirac operator (left panel), and the 
spectral gap as determined from the lowest eigenmode of the Hermitian 75D Dirac operator on the 
same configurations (right panel). The dashed line in both plots indicates the median of the spectral 
gap distribution. Data are from 100 12^ x 24 configurations with lattice spacing a « 0.13 fm and 
bare quark mass m 69 MeV. 



The left panel of Fig [4] shows the infrared spectrum (lowest 40 eigenmodes) of the n- 
HYP Dirac clover operator from 100 configurations. The complex spectrum has a rather 
well defined left boundary that follows a circle with only a few real modes violating that 
bound. This indicates that even at this coarse lattice spacing much smaller quark masses 
can be reached without encountering exceptional configurations. A similar plot is published 
in Ref. 23|| showing the eigenmodes of the chirally improved CI Dirac operator in two fiavor 



dynamical simulations at similar lattice spacing and volume, though about 50% lighter quark 
masses. The spectrum in Fig. [4] compares well with that plot, showing similar widening of 
the Ginsparg-Wilson circle for the two actions. 

A more direct measure of the accessible mass range is the spectral gap, i.e. the distribution 
of the smallest magnitude eigenvalue of the Hermitian Dirac operator 'y^D [^l- This 
distribution is plotted on the right panel of Fig. [4] with the median fl = 63.3(4) MeV 
marked by a dotted line. The ratio of the median and the PCAC quark mass is indicative of 
the renormalization factor ZA/{ZmZp) 

liEi 

and the value we obtain, 0.91, signals small 

perturbative corrections. 

To facilitate comparison with similar distributions in Ref. the data is plotted with the 
same bin size, A/x = 1.5 MeV. The width of the distribution, defined as half the width of the 
shortest interval that contains 68.3% of the data, is cr = 5.5(6) MeV. Since the distribution in 
Fig. [His quite asymmetric, it is more physical to define the width as the interval to the left of 
the median that contains 68.3% of the data. This modified definition gives a = 4.6(6) MeV. 



16 



One expects that simulations at quark masses of about 3a are safe, which corresponds to 
^ 15MeV at this volume and lattice spacing. In Ref. ^ it was found that, at least for 
unimproved thin link Wilson fermions [2j|, the width of the spectral gap scales inversely 
with the square root of the volume, ay/V ~ 1. Assuming the same scaling law in our case 
we find aW ~ 0.61 — 0.73, depending on the definition of the width. The decrease signals 
the improved chiral properties of the smeared Dirac operator. The median /i of the lowest 
mode of the Hermitian operator is also indicated in the complex Dirac spectrum, where it 
is tangent to the circle that bounds the spectrum. 



V. CONCLUSIONS 

We have successfully implemented and tested a gauge link smearing scheme that inherits 
the good properties of HYP smearing (locality and removal of dislocations) while still being 
suitable for MD based algorithms. This is achieved by replacing the projection steps in 
the original HYP construction by normalizations to the corresponding unitary group, thus 
allowing the calculation of the molecular dynamics force for fermions coupled to the smeared 
links. We have tested the n-HYP smearing with overlap fermions where we found that they 
can be simulated as effectively as 3 level stout smeared fermions and about twice as fast 
as 2-level stout smeared ones. We have also implemented the smearing with Wilson -clover 
fermions. Our preliminary tests indicate that light quarks, even as low as 15 MeV, can be 
simulated at a ~ 0.13 fm lattices and volumes aL > 1.6 fm. In addition, the smoothness of 
the smeared links speed up the inversion of the Dirac operator. 

We have reported only preliminary results here. The volume, quark mass, and lattice 
spacing dependence of Wilson clover simulations with n-HYP links will be tested in the 
future. 
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